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The combination of space-time coding (STC) and continuous phase modulation (CPM) is an attractive field of 
research because both STC and CPM bring many advantages for wireless communications. Zhang and Fitz [Tl were 
the first to apply this idea by constructing a trellis based scheme. But for these codes the decoding effort grows 
exponentially with the number of transmitting antennas. This was circumvented by orthogonal codes introduced by 
00 , Wang and Xia [2J. Unfortunately, based on Alamouti code [3], this design is restricted to two antennas. 

■ However, by relaxing the orthogonality condition, we prove here that it is possible to design L^-orthogonal space- 
time codes which achieve full rate and full diversity with low decoding effort. In part one, we generalize the two- 
antenna code proposed by Wang and Xia 2\ from pointwise to L^-orthogonality and in part two we present the first 
^ L^-orthogonal code for CPM with three antennas. In this report, we detail these results and focus on the properties 
of these codes. Of special interest is the optimization of the bit error rate which depends on the initial phase of the 
system. Our simulation results illustrate the systemic behavior of these conditions. 
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I — ^; 1 Part one: Two antennas case 



t~H ' To combine the high power efficiency of Continuous Phase Modulation (CPM) with either high spectral efficiency or 
C/3 . enhanced performance in low Signal to Noise conditions, some authors have proposed to introduce CPM in a MIMO 
I frame, by using Space Time Codes (STC). In this part, we address the code design problem of Space Time Block 
, Codes combined with CPM and introduce a new design criterion based on orthogonality. This orthogonality 
condition, with the help of simplifying assumption, leads, in the 2x2 case, to a new family of codes. These codes 
; generalize the Wang and Xia code, which was based on pointwise orthogonality. Simulations indicate that the new 
' codes achieve full diversity and a slightly better coding gain. Moreover, one of the codes can be interpreted as two 
f — . : antennas fed by two conventional CPMs using the same data but with different alphabet sets. Inspection of these 
' alphabet sets lead also to a simple explanation of the (small) spectrum broadening of Space Time Coded CPM. 

\6 

O 1.1 Introduction 
OO 

' Since the pioneer work of Alamouti [3J and Tarokh [3] , Space Time Coding has been a fast growing field of research 
I where numerous coding schemes have been introduced. Several years later Zhang and Fitz [H [5] were the first to 
k>( ^PP^y idea, of STC to continuous phase modulation (CPM) by constructing trellis codes. In [5] Zajic and Stiiber 
rN _ derived conditions for partial response STC-CPM to get full diversity and optimal coding gain. A STC for noncoherent 
' detection based on diagonal blocks was introduced by Silvester et al. [7]. 

The first orthogonal STC for CPM for full and partial response was developed by Wang and Xia [51 [5] . The scope 
of this part is also the design of an orthogonal STC for CPM. But unlike Wang-Xia aprroach [2] which starts from a 
QAM orthogonal Space-Time Code (e.g. Alamouti's scheme [3]) and modify it to achieve continuous phases for the 
transmitted signals, we show here that a more general LP' condition is sufficient to ensure fast maximum likelihood 
decoding with full diversity. 

In the considered system model (Fig. [T]), the data sequence dj is defined over the signal constellation set 

Vld = {-M + 1, -M + 3, . . . , M - 3, M - 1} (1) 

for an alphabet with logj M bits. To obtain the structure for a Space Time Block Code (STBC) this sequence is 
mapped to data matrices D*-*) with elements dmr, where m denotes the transmitting antenna, r the time slot into a 
block and (z) a parameter for partial response CPM. The data matrices are then used to modulate the sending matrix 



S(t) 



Sll(t) Si2(i) 
S2l{t) S22{t) 



(2) 
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Figure 1: Structure of a MIMO Tx/Rx system 



Each element is defined for (2/ + r - l)r < i < {21 + r)T as 

= y^e^-2-^"-W (3) 

where Eg is the symbol energy and T the symbol time. The phase 4imr{i) is defined in the conventional CPM manner 
[S] with an additional correction factor Cmr{t) and is therewith given by 

2l+r 

4>mr{t) = em{2l + r) + /l ^ d« q(i - {l - l)T) + C™,(i) (4) 
i=2i+l+r-7 

where h — 2mo/p with mp and p relative primes is called the modulation index. The phase smoothing function q{t) 
has to be a continuous function with q{t) = for i < and q{t) = 1/2 for t > jT. 

The memory length 7 determines the length of q{t) and affects the spectral compactness. For large 7 we obtain 
a compact spectrum but also a higher number of possible phase states which increases the decoding effort. For full 
response CPM, we have 7 = 1 and for partial response systems 7 > 1. 

The choice of the correction factor Cmr{t) in Eq. ^ is along with the mapping of dj to D^*-', the key element in 
the design of our coding scheme. It will be detailed in Section [TT^ We then define 6'm(2Z + r) in a most general way 

9^{2l + 3) = ern{2l + 2) + ^(2; + 2) = Orai^l + 1) + ^(2/ + 1) + ^{21 + 2). (5) 

The function ^(2Z + r) will be fully defined from the contribution Cmr{t) to the phase memory 0^(21 + r). For 
conventional CPM system, Cmr{t) = and we have 1^(2^ + 1) = ■|d2i+i-7- 

The channel coefficients amn are assumed to be Rayleigh distributed and independent. Each coefficient a,„„ char- 
acterizes the fading between the m*'' transmit (Tx) antenna and the n'^ receive (Rx) antenna where n = 1,2, Lr- 
Furthermore, the received signals 

yn{t) = a„inSrnr{t) + n{t) (6) 

are corrupted by a complex additive white Gaussian noise n{t) with variance 1/2 per dimension. 

At the receiver, the detection is done on each of the Lr received signals separately. Therefore, in general, each 
code block S(t) has to be detected by block. E.g. for a 2x2 block, estimating the symbols dj implies computational 
complexity proportional to Af ^ . Now, this complexity can be reduced to 2M by introducing an orthogonality property 
as well as simplifying assumptions on the code. 

Criteria for such STBC are given in Section 11.21 In Section 11.31 the criteria are used to construct OSTBC for 
CPM. In Section we test the designed code and compare it with the STC from Wang and Xia Q^. Finally, some 
conclusions are drawn in Section 11.51 

1.2 Design Criteria 

The purpose of the design is to achieve full diversity and a fast maximum likelihood decoding while maintaining 
the continuity of the signal phases. This section shows how the need to perform fast ML decoding leads to the 
orthogonality condition as well as to simplifying assumptions, which can be combined with the continuity conditions. 
For convenience we only consider one Rx antenna and drop the index n in amn- 

1.2.1 Fast Maximum Likelihood Decoding 

Commonly, due to the trellis structure of CPM, the Viterbi algorithm is used to perform the ML demodulation. On 
block / each state in the trellis has incoming branches and outgoing branches with a distance 
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(2/+l)T (2/+2)T 

y(i) - ^ a„s„i(t) dt+ / - ^ a™s,„2(i) di. (7) 

2;t "^-^ (2;+i)T ™"-^ 

The number of branches results from the blockwise decoding and the correlation between the sent symbols sir{t) and 
S2r{t)- A way to reduce the number of branches is to structurally decorrelate the signals sent by the two transmitting 
antennas, i.e. to put to zero the inter-antenna correlation 

(2i+l)T (2i + l)T (2i+2)T (2;+2)T 

a2al / S2i{t)s\^{t)At + aial sii{t)sl^{t)At + a2a\ S22{t)s\^{t)dt + aial s (8) 



21T 21T (2i+l)T (2i+l)T 

Pointwise orthogonality as defined in is therefore a sufficient condition but not necessary. A less restrictive 
orthogonality is also sufficient. From Eq. ([8]), the distance given in Eq. ([7|) can then be simplified to 

(2i+l)T (2i+2)T 

A= Jfii{t) + f2iit)-\y{t)\'dt+ I fi2{t) + f22{t) - \y{t)\' dt (9) 

21T (2i+l)T 

with fmr{t) = \y{t) ~ cxmSmr{t)\'^ ■ When cach Srnr(i) depends only on (i2i+i or d2i+2 the branches can be split and 
calculated separately for d2i+i and d2i+2- The complexity of the ML decision is reduced to 2M . The complexity for 
detecting two symbols is thus reduced from pM'^'^^ to pM'^ . The STC introduced by Wang and Xia [5] didn't take 
full advantage of the orthogonal design since Smr{t) was depending on both d2i+i and d2i+2- The gain they obtained 
in [2] was then relying on other properties of CPM, e.g. some restrictions put on q{t) and p. These restrictions may 
also be applied to our design code, which would lead to additional complexity reduction. 

1.2.2 Orthogonality Condition 

r 2 / ~1~ 2 1 T 

In this section we show how L2 orthogonality for CPM, i.e. I|S(t)||i^ = hiT S{t)S"{t)dt = 21, can be obtained. 
As such, the correlation between the two transmitting antennas per coding block is canceled if 

(2i+2)T (2i+l)T (2i+2)T 

Slrit)s*2^it)dt= J Siiit)s*2iit) dt + J Si2it)s*22it)dt = 0. (10) 
21T 21T (2; + l)T 

Replacing Smr{t) by the corresponding CPM symbols from Eq. we get 

+ 21+1 21+2 



exp|i27r [di {21 + 1) + hY^d['\q{t - (i - 1)T) + Ci,i (t) - 6'2(2Z + 1) - ft ^ d^'lqit - (i - l)r) - C2,i (t)] | dt+ 

i=2l+2-i i=2l+3-i 
' 21+2 21 + 1 

exp|j27r[e»i(2Z + 2) + h^d^qit - {i - l)T)+ci,2(t) - 62(21 + 2)~-hJ2 d%^^ q{t - iT) - C2,2{t)] dtj = 0. (11) 



I' 



(2l+l)T i=2i+3-7 i=2;+2-7 



The phase memory 0m (2? + r) is independent of time and has not to be considered for integration. Using Eq. ([5]) to 
replace phase memory 0„i{2l + 2) of the second time slot, we obtain 



21 + 1 21 + 1 



I 



exp I j27r [h d^\q{t - [i - 1)T) + Ci,i (t) - ft ^ 4'U(* - (« ^ 1)^) - C2,i (t)] j dt + exp | j27r [Ci [21 + 1) - (2Z + 1)] | • 

=2;+2-7 i=2;+2 — / 

, ' 21+1 21 + 1 

exp |j27r[/i Y d^i^^^q{t - (i - 1)T) + ci,2(t + T)-hJ2 di';^'''q{t - {i - 1)T) - C2,2(t + T)] | dt = 0. (12) 



I' 



i=2i+2 — f i=2!+2-7 



1.2.3 Simplifying assumptions 

To simplify this expression, we factor Eq. (|12p into a time independent and a time dependent part. For merging the 
two integrals to one time dependent part, we have to map (i|^2 to and Cmr{t) to a different Cm'r'{t)- Consequently, 
for the data symbols dmr there exist three possible ways of mapping: 
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• crosswise mapping with di \ ~ ^3*2 and d^i \ — rfj'i; 

• repetitive mapping with di \ — d^i \ and x — '^2*2 i 

• parallel mapping with d^^\ — d^2 \ ^1*2 ~ ^2*2 • 
The same approach can be apphed to Cmr{t)- 

• crosswise mapping with cii(t) = — C22(i — T) and Ci2{t) — — C2i(t — T); 

• repetitive mapping with cii(t) = ci2(/: — T) and C2i(i) = C22{t — T); 

• parallel mapping with cii(t) = C2i(t) and Ci2{t) — C22{t). 



For each combination of mappings, Eq. (jl2p is now the product of two factors, one containing the integral and the 
other a time independent part. To fulfill Eq. (fT^ it is sufficient if one factor is zero, namely l + e'^'^['^i(^'+^^~^^'^^'+^)l = 
0, i.e. if 

+ 6(2^ + 1) -6(2^ + 1) (13) 
with fc G N. We thus get a very simple condition which only depends on £,m{2l + 1). 

1.2.4 Continuity of Phase 

In this section we determine the functions ^,n(2^ + 1) to ensure the phase continuity. 

Precisely, the phase of the CPM symbols has to be equal at all intersections of symbols. For an arbitrary block I, 
it means that (j),ni{{2l + l)T) = (/),„2((2/ + 1)T). Using Eq. it results in 

2; + l 2/+2 

U{21 + 1) = l^y^^{2l + 2 - i)T) + c™,i((2/ + l)r) - ;i^4^29((2/ + 2 - i)T) - c„,2((2/ + l)r). (14) 

i=2;+2~7 i=2J+3-7 

For the second intersection at [21 + 2)T , since (t>m2{{2l + 2)T) = (t>,ni{{2l + 2)T), we get 

21+2 2(/+l)+l 

U(2/ + 2)^h Y,d%{{2l + 3 - i)T) + Cra2{i2l + 2)T) - ^ rf™,i9((2; + 3 - i)T) - c„,a((2/ + 2)T). (15) 

i=2;+3-7 i=2(; + l)+2 — 1 

Now, by choosing one of the mappings detailed in Section [T31 these equations can be greatly simplified. Hence, 
we have all the tools to construct our code. 

1.3 Orthogonal Space Time Codes 

In this section we will have a closer look at two codes constructed under the afore- mentioned conditions. 
1.3.1 Existing Code 

As a first example, we will give an alternative construction of the code given by Wang and Xia in ^2j. Indeed, the 
pointwise orthogonality condition used by Wang and Xia is a special case of the orthogonality condition, hence, 
their ST-code can be obtained within our framework. 

For the first antenna Wang and Xia use a conventional CPM with d\^ = for i = 2? + r + 1 — 7, 2Z + r + 2 — 
7, . . . , 2Z + r and cir{t) = 0. The symbols of the second antenna are mapped crosswise to the first d2l = — rfj+i for 
i = 2; + 2 - 7, 2; + 3 - 7, . . . , 2? + 1 and 4^^^ = -d.L_i for i = 2? + 3 - 7, 2? + 4 - 7, . . . , 2/ + 2. Using this cross 
mapping makes it difficult to compute ^rn{2l + 1) since the CPM typical order of the data symbols is mixed. Wang 
and Xia circumvent this by introducing another correction factor for the second antenna 

7-1 

C2r(i) ^J2^h{d2l + l-^ + d2l+2-^) + l)go(t - {21 + V - I - i)T) . (16) 
1=0 

By first computing £,m(2l + 1) with Eq. (fT7|) and then Eq. (fT3|) . we get the orthogonality of the Wang-Xia-STC. 
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Figure 2: Left: Simulated BER for different numbers of Tx and Rx antennas of the proposed STC and of the Wang-Xia-STC; 
Right: Simulated psd for each Tx antenna of the proposed STC (continuous line) and the Wang-Xia-STC (dashed line) 



1.3.2 Parallel Code 

To get a simpler correction factor as in [5] , we designed a new code based on the parallel structure which permits to 
maintain the conventional CPM mapping for both antennas. Hence we choose the following mapping: = (f^^^ = 
for i = 2/ + r + 1 - 7, 2? + r + 2 - 7, . . . , 2; + r. Then, Eq. (HH) and can be simplified into 

+ = ^d2i+2-7 + Cmi((2/ + l)T)-c„,2((2? + l)r)6„(2Z + 2) = :^rf2;+3-7 + Cm2((2Z + 2)r)-c„.i((2/ + 2)T). (17) 

With this simplified functions, the orthogonality condition only depends on the start and end values of Crnr{t)^ i-e. 

fc + i - cn((2; + l)T) - ci2{i2l + 1)T) - C2i((2/ + l)r) + C22((2; + l)r). (18) 

To merge the two integrals in Eq. the mapping of dmr is necessary but also an equality between different Cmr(t). 

From the three possible mappings, we choose the repeat mapping because of the possibility to set Cmr{t) to zero for 
one antenna. Hence we are able to send a conventional CPM signal on one antenna and a modified one on the second. 
Using Eq. psp and the equalities for the mapping, we can formulate the following condition 

fc -f i = Ci2(2/r) - Ci2((2/ + 1)T) - C22(2/r) + C22((2/ + 1)T). (19) 

With cii(i) — Ci2{t) = 0, we can take for C2i(i) — C22{t) any continuous function which is zero aXt = Q and 1/2 at 
t = T. Another possibility is to choose the correction factor of the second antenna with a structure similar to CPM 
modulation, i.e. 

2i-f 1 

C2.(i)= E 9(i-(«-i)r) (20) 

for {21 + r — 1)T < t < {21 + r)T . With this approach, the correction factors can be included in a classical CPM 
modulation with constant offset of 1/h. This offset may also be expressed as a modified alphabet for the second 
antenna 

-{-M + l + -i,-M + 3 + i ...,M-3 + -i,M-l + i}. (21) 

Consequently, this L^-orthogonal design may be seen as two conventional CPM designs with different alphabet 
sets VLd and for each antenna. However, in this method, the constant offset to the phase may cause a shift in 
frequency. But as shown by our simulations in the next section, this shift is quite moderate. 

1.4 Simulations 

In this section we verify the proposed algorithm by simulations. Therefore a STC-2REC-CPM-sender with two 
transmitting antennas has been implemented in MATLAB. For the signal of the first antenna we use conventional 
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Gray-coded CPM with a modulation index h = 1/2, the length of the phase response function 7 = 2 and an alphabet 
size of M = 8. The signal of the second antenna is modulated by a CPM with the same parameters but a different 
alphabet ild2 1 corresponding to Eq. (PT|) . 

The channel used is a frequency flat Rayleigh fading model with additive white Gaussian noise. The fading 
coefficients amn are constant for the duration of a code block (block fading) and known at receiver (coherent detection). 
The received signal yn{t) is demodulated by two filterbanks with pM^ filters, which are used to calculate the correlation 
between the received and candidate signals. Due to the orthogonality of the antennas each filterbank is independently 
applied to the corresponding time slot k of the block code. The correlation is used as metric for the Viterbi algorithm 
(VA) which has pM states and M paths leaving each state. In our simulation, the VA is truncated to a path memory 
of 10 code blocks, which means that we get a decoding delay of 2 • lOT. 

From the simulation results given in Figure [H we can reasonably assume that the proposed code achieves full 
diversity. Indeed, the curves for the 2x1 and 2x2 systems respectively show a slope of 2 and 4. Moreover, the curve of 
the 2x1 systems follows the same slope as the ST code proposed by Wang and Xia [2] , which was proved to have full 
diversity. Note also that the new code provides a slightly better performance. 

A main reason of using CPM for STC is the spectral efficiency. Figure [2] show the simulated power spectral density 
(psd) for both Tx antennas of the proposed ST code (continuous line) and the ST code proposed by Wang and Xia [2] . 
The first antenna of our approach uses a conventional CPM signal and hence shows an equal psd. The spectrum of 
the second antenna is shifted due to adding an offset Cmr{t) with a non zero mean. Minimizing the difference between 
the two spectra by shifting one, result in a phase difference of 0.375 measured in normalized frequency / • T^, where 
Td = r/log2(M) is the bit symbol length. The first antenna of the Wang-Xia-algorithm has almost the same psd 
while the spectrum of the second antenna is shifted by approximately 0.56/ • Td- This means that the OSTC by Wang 
and Xia requires a slightly larger bandwidth than our OSTC. 

1.5 Conclusion to part one 

In applications where the power efficiency is crucial, combination of Continuous Phase Modulation and Space Time 
Coding has the potential to provide high spectral efficiency, thanks to spatial diversity. To address this power efficiency, 
ST code design for CPM has to ensure both low complexity decoding and full diversity. To fulfill these requirements, 
we have proposed a new orthogonality condition. We have shown that this condition is sufficient to achieve low 
complexity ML decoding and leads, with the help of simplifying assumption to a simple code. Moreover, simulations 
indicate that the code most probably achieves full diversity. In the next part of this report, we will concentrate on 
the design of other codes based on orthogonality and will show how to design full diversity, full rate orthogonal 
codes for 3 antennas. 

2 Part two: Extension to more antennas 

To combine the power efficiency of Continuous Phase Modulation (CPM) with enhanced performance in fading envi- 
ronments, some authors have suggested to use CPM in combination with Space-Time Codes (STC). In part one, we 
have proposed a CPM ST-coding scheme bases on L^-orthogonality for two transmitting antennas. In this part we ex- 
tend this approach to the three antenna case. We analytically derive a family of coding schemes which we call Parallel 
Code (PC). This code family has full rate and we expect that the proposed coding scheme achieves full diversity. This 
is confirmed by accompanying simulations. We detail an example for the proposed STC which can be interpreted as 
a conventional CPM scheme with different alphabet sets for the different transmit antennas which results in simpli- 
fied implementation. Thanks to L^-orthogonality, the decoding complexity, usually exponentially proportional to the 
number of transmitting antennas, is reduced to linear complexity. 

2.1 Introduction 

To overcome the reduction of channel capacity caused by fading, Telatar [TUj, Foschini and Gans [Tl] described in the 
late 90s the potential gain of switching to multiple input multiple output (MIMO) systems. These results triggered 
many advances mostly concentrated on the coding aspects for transmitting antennas, e.g. Alamouti f31 and Tarokh et 
al. [1] for Space-Time Block Codes (STBC) and also Tarokh et al. ^12J for Space-Time Trellis Codes. 

Zhang and Fitz [HIS] were the first to apply the idea of STC to CPM by constructing trellis codes. In [7], Silvester 
et al. derived a diagonal block space-time code which enables non-coherent detection. A condition for optimal coding 
gain while sustaining full diversity was also recently derived by Zajic and Stiiber [5]. 

Inspired by orthogonal design codes, Wang and Xia introduced in [S] the first orthogonal STC for two transmitting 
antennas and full response CPM and later in [2] for partial response. Their approach was extended in |13j to construct 
a pseudo-orthogonal ST-coded CPM for four antennas. To avoid the structural limitation of orthogonal design, 
we proposed in |l4j a STC CPM scheme based on orthogonality for two antennas. Sufhcient conditions for 
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Figure 3: Structure of a MIMO Tx/Rx system 

orthogonality were described, orthogonal codes were introduced and the simulation results displayed good 
performance and full rate. Here, motivated by this results, we extend our previous work and generalize these conditions 
for three transmitting antenna. 

The main result of the three transmit antenna case, is that it can, unlike the codes based on orthogonal design, 
achieve full diversity with a full rate code: 



1. 



the full rate property is one of the main advantage of using the norm criterion, instead of merely extending 
the classical Tarokh [5 orthogonal design to the CPM case. Indeed, in the classical orthogonal design approach, 
which is based on optimal decoding for linear modulations, the criterion is expressed as the orthogonality between 
matrices of elements, each of these elements being a definite integral (usually the output of a matched filter). 
On the contrary, in the design approach used for non-linear modulations, the product in Eq. (|29p is a definite 
integral itself, the integrand being the product of two signals. This allows more degrees of freedom and enables 
the full rate property. 

2. the full diversity property can be proved in a similar way to the classical case, with the help of the extensions 
proposed by Zajic and Stiiber 6J. 

Furthermore, it should be pointed out that the proposed coding scheme does not limit any parameter of the CPM. 
It is applicable to full and partial response CPM as well as to all modulation indexes. 

We first give the system model for a multiple input multiple output (MIMO) system with Lt transmitting (Tx) 
antennas and Lr receiving (Rx) antennas (Fig. Later on, we will use this general model to derive a L2-0STC for 
CPM for Lt — 3. The emitted signals s{t) are mixed by a channel matrix A of dimension Lr x Lt- The elements 
of A, an,m, are Raylcigh distributed random variables and characterize the fading between the n*'* Rx and the m*'' 
Tx antenna. The Tx signal is disturbed by complex additive white Gaussian noise (AWGN) with variance of 1/2 per 
dimension which is represented by a x Lt matrix n(t). The received signal 



y(t) = As(t) + n(i). 

has the elements yn^r and the dimension Lr x Lt- We group the transmitted CPM signals into blocks 

SlA{t) ... Sl,Lt(t) 



s{t) = 



(22) 



(23) 



similar to a ST block code with the difference that now the elements are functions of time and not constant anymore. 
The elements of Eq. ([25)1 are given by 



At) 



LtT 



exp {j2'K(j)mAt)) 



(24) 



for {Ltl + r — 1)T < t < {Ltl + r)T and m, r = 1, 2, . . . , Lt. Here m represents the transmitting antenna and r the 
relative time slot in the block. The symbol energy Eg is normalized to the number of Tx antennas Lt and the symbol 
length T. The continuous phase 



4>mAt) = SALtl + r) + /i ^ d!i$q{t - I'T) + : 



A) 



(25) 



is defined similarly to [3] with an additional correction factor Cm,rit) detailed in Section 12.2.31 Furthermore, I is 
indexing the whole code block, i the overlapping symbols for partial response and i' = Ltl + r — i. With this extensive 
description of the symbol dm,r, we are able to define all possible mapping schemes (cp. Section [2. 2. 2p . The modulation 



7 



index h ~ 2mo/p is the quotient of two relative prime integers mo and p and the phase smoothing function q{t) has 
to be continuous for < t < jT, for t < and 1/2 < jT. The memory length 7 gives the number of overlapping 
symbols. 

To maintain continuity of phase, we define the phase memory 

9,n{Ltl + r + 1) - e^{Ltl + r)+ U{Ltl + r) (26) 

in a general way. The function £,{Ltl + r) will be fully defined in Section [2.2.31 from the contribution of Cmr{t)- For a 
conventional CPM system, we have c„ir{t) — and £,{21 + 1) = ■|d2i+i-7- 

In Section 12. 2[ we derive the conditions for a CPM with three transmitting antennas and introduce adequate 
mappings and a family of correction factors. In Section we detail some properties of the code. In Section [TH we 
benchmark the code by running some simulations and finally, in Section [2.51 some conclusions are drawn. 



2.2 Parallel Codes (PC) for 3 antennas 
2.2.1 Orthogonality 

In this section we describe how to enforce L"^ orthogonality on CPM systems with three transmitting antennas. 
Similarly to I14j, we impose orthogonality by 

(3i+3)T 



J s{t)s"{t)dt^ Esl (27) 



31T 



where I is the 3x3 identity matrix. Hence the correlation between two different Tx antennas Sm.r{t) and Sm'.r(i) is 
canceled over a complete STC block if 

(3i+3)T 

Sm,r(iX'.Wdt = (28) 



ZIT 

with m ^ m' . Now, by using Eq. (p4|l and (|25|l we get 

(29) 

The phase memory 9m{i^ + t) is time independent and therewith can be moved to a constant factor in front of the 
integrals. Similarly to [14| . we introduce parallel mapping {dm^} — cfjyl^\) for the data symbols and repetitive mapping 
{cm,r{t) — Cm,r'{t)) for the Correction factors. The integral on three time slots can then be merged into one time 
dependent factor. Furthermore, we obtain a second, time independent factor from the phase memory. Now, by using 
Eq. (pS)) one can see that the condition from Eq. (^5)1 is fulfilled if 

= 1 + exp(jai) + exp(jai) exp(ja2) (30) 

where — 2t: [Cmi'^l + r) — ^m'(3' + '")] f^nd we get — exp(— joi) = 1 + exp(ja2). By splitting this equation into 
imaginary and real parts, we have the following two conditions: 

— 1 = cos(— fli) + cos(a2) (31) 
= sin(— oi) + sin(a2). (32) 

This system has, modulo 27r, two pairs of solutions 

(ai, 02) e {(2^3, 27r/3), (4^3, 47r/3)}. (33) 

Hence orthogonality is achieved if £m{3l + r) — £_rn' (3Z + r) = 1/3 or £_,n{il + r) — ^,„/ {31 + r) = 2/3 for r = 1, 2 
and for all combinations of m and m' with m ^ to'. In order to determine ^m(3? + ^)j we detail in the following section 
the exact mapping and the correction factor. 
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Figure 4: Mapping of the data sequence to the data symbols 



2.2.2 Mapping 

In this section we describe the mapping of the data sequence dj to the data symbols dm^J of the block code (Fig. 
HI). To obtain full rate each code block have to include three new symbols from the data sequence. In general, the 
mapping of the three new symbols has no restrictions. However, to fix a mapping two criteria are considered: 

• mapping to simplify Eq. (|29p 

• low complexity of function ^,,,.(3/ + r). 

The first criteria is already determined by using parallel mapping {d^^} = d^^\). Therewith the mapping for the 
m-dimension (Fig. |4|) is fixed. For the remaining two dimensions we choose a mapping similar to conventional CPM. 
The subsequent data symbols in r-direction are mapped to subsequent symbols from the data sequence. Also similar 
to conventional CPM we shift this mapping by —i and obtain 

d'^^} = d^l+r-^+l. (34) 

Eq. ([35| and (|36|) show the simplification of the function ^m(3/ + r). 

2.2.3 Correction Factor 

The choice of the phase memory and therewith of the function ^m(3/ + ?') ensures the continuity of phase. If <j)m,r{{Lil + 
r)T) — (j)m,r+i{{Ltl + r)T), we always obtain the desired continuity. Hence, 

7 7 
UiLtl + r)= hY.dlk'lqm + ^raA{^l + r)T) - hY.d'^nl+iliiT) - c„,,+i((3? + r)T). (35) 

i=l i=l 

With a mapping similar to conventional CPM (Section l2.2.2l) we can simplify the two sums to a single term and obtain 
for r = 1,2 

Cm(3/ + r) = 2'^3J+r-7+i + Cm,r((3/ + r)T) - c„,r+i((3^ + r)T). (36) 

As the data symbols are equal on each antenna, the difference between two different ^m(3Z + r) does not depend on 

the data symbol ^3/+^ Thus, when choosing parallel mapping, orthogonality only depends on the correction 

factor. 

To fulfill Eq. (j30|) for all antennas we take 

• for m = 1, m' = 2 

flr = Y = 2^[ci,,((3? + r)T) - ci,,+i((3/ + r)T) ~ c^^i^l + r)T) + C2,,+i((3? + r)T)], (37) 

• for m = 2, to' = 3 

a. - y = 2^[c2,.((3Z + r)T) - C2,.+i((3/ + r)T) - C3..((3; + r)T) + C3,.+i((3? + r)T)] (38) 



9 



• and for m — 1, m' ~ 3 we consequently get 
47r 

ar = Y = 2^[ci,,((3/ + r)T) - ci,,+i((3? + r)T) - C3,.((3/ + r)r) + C3,.+i((3/ + r)T)]. (39) 

The other three possible combinations of m and m' with m ^ m' lead only to a change of sign and we get = 
— 27r/3, — 27r/3, — 47r/3, respectively. Due to the modulo 2n character of our condition, these are also vahd solutions. 

For simplicity, we assume similar correction factors for each time slot r of one Tx antenna Cm,iit) — Cm,2{t) = 
Cm,3{t)- Since Eq. ((5^ arises from Eq. ((S7)) and (jHS]) . we have two equations and three parameters: ci,r(i), C2,r(^) 
and C3_r(i). Hence we define C2,r(t) = for r = 1,2,3 and we get ci,r((3Z + r)T) — ci,r+i((3/ + r)T) = 1/3 and 
C3,r((3/ + r)T) — C3_r.+i((3/ + r)T) — —1/3 for r = 1,2. Codes fulfilling these conditions form the family of Parallel 
Codes (PC). We will now describe some possible solutions of this family. 

An obvious solution for the correction factor is obtained for all functions which are for t — {31 + r)T and ±1/3 
ioYt= {il + r + l)T, e.g. 

/^ 2 t-(3/ + r)r 
ciAt) = -C3,r-(t) = \^ ' (40) 

for (?>l + r)T < t < {31 + r + 1)T. We denote this solution as linear parallel code (linPC). Of course, other choices are 
possible, e.g. based on raised cosine (rcPC). 

Another way of defining the correction factor is 

ciAt) = -csAt) = - i'T) (41) 



1=1 



for (3/ + r)T < t < {31 + r + 1)T. In that case we take advantage of the natural structure of CPM, i.e. in Eq. ([55)) 
all except one summands cancel down, similar to the terms with the data symbols. This definition has the advantage 
that we can merge the correction factor and the data symbol in Eq. (|25p and we obtain two pseudo alphabets shifted 
by an offset (offPC) for the first and third transmitting antenna 
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M + 1- 
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" 3h'' 


-MA 


'^~3h'-- 


. . , M - 1 - 


2 

" 3h 



^d, -- 
^d, -- 

Consequently, this i^-orthogonal design may be seen as three conventional CPM signals with different alphabet 
sets fid, f^diand fld^ for each antenna. In this method, the constant phase offsets introduce frequency shifts. But as 
shown by the simulations in next section, these shifts are quite moderate. 

2.3 Properties of PC CPM 
2.3.1 Decoding 

The optimal receiver for the proposed codes relies on the computation of a metric over complete ST blocks followed 
by a maximum-likelihood sequence estimation (MLSE). 
Here, the metric is evaluated using the norm 

(3i+3)T g 

Dl = J \yi,r{t) - ai^„iS,n,r{t)\^ dt. (42) 

31T 

For convenience, we use here only one receiving antenna but the extension to more than one is straightforward. The 
distance in Eq. is obtained for all pAf^^^*"^ possible variations of s„i^r{t) corresponding to the paths of the 

trellis. 

The number of states can be reduced in two ways. First, by using the orthogonality property of the proposed code, 
all cross-correlations in Eq. ([1^ are canceled out and we obtain 

3 3 (3i+r+l)T 



-02=^^ / \yi,r{t) - ai^rnSrn,r{t)f dt. (43) 



m=Or=0 (3;^^^^ 
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(a) offPC (b) linPC 



Figure 5: Simulation results for the BER with different initial phases 9i{0) 

We only have to consider pM^ paths for every Sm,rit). The complexity of computing the distance is therewith L^pM^ 
which corresponds to the necessary effort to decode three symbols of three CPM signals. 

Second, by taking advantage of the parallel mapping we are not forced to decode block-wise. We can compute the 
distances symbol-wise with 

(/3+l)r 3 

Dy,^ j \yi,r{^) - ai^rnSmA^)\^ '^'t (44) 

l^jrp m— 1 

with r = {(3 mod Lt) + 1. Similarly to conventional CPM, Sm,r{t) has pAH possible values which have to be evaluated 
for every antenna m. By doing that, we reduce the paths of the trellis but, at the same time, we increase the number 
of transitions in the trellis. 

The number of paths can be further reduced by using some special properties of CPM. There exist numerous 
efficient algorithms for MLSE. However, the efficiency of the detection algorithm is not in the primary scope of this 
report and will be the subject of another upcoming paper. 



2.3.2 Diversity 

Simulations of the proposed code show a similar behavior as codes with full diversity. But, in contrast to orthogonal 
code for two antennas [13], the diversity of the three antenna code depends on the initial phase 9i{0) of each antenna 
i. Figure [5] shows the simulation results for 6*3(0) = and varying 6*1(0) and ^2(0). The bit error rate (BER) clearly 
depends on the choice of the initial phase. This effect is different for offset PC (Figure 5(a)) and linear PC (Figure 



5(b)). 



The shown simulation results are using 4-array CPM (M — 4) at Eb/No — 13dB. Further simulation with M — 8 
show no difference in the location of the minimal BER. Whereas the varying modulation indexes h slightly change the 
position of the minima. 



2.4 Simulations 

In this section we test the proposed algorithms by running MATLAB simulations. More precisely, we benchmark the 
offset parallel code (offPC) for two and three Tx antennas and the linear parallel code (linPC) for three Tx antennas. 
For all simulations we use a Gray-coded CPM with a modulation index of 1/2, an alphabet of 2 bits per symbol 
(M = 4) and a memory length 7 of 2. We use a linear phase smoothing function q{t) (2REC). Corresponding to 
sectionHXl we use 6*1(0) = 0.75, 6*2(0) = 15 and 6*3(0) = for linPC and 6*1(0) = 0.45, 6*2(0) = 0.1 and 6*3(0) = for 
offPC. 

The modulated signals are transmitted over a frequency flat Rayleigh fading channel with complex additive white 
Gaussian noise. The fading coefficients an,m are constant for the duration of a code block (block fading) and known 
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Figure 6: Left: Simulated bit error rate (BER) over a Rayleigh fading channel; Right: Power spectral density of the 
linPC analyzed with the Welch algorithm. 

at the receiver (coherent detection) . To guarantee a fair treatment of single and multi antenna systems the fading has 
to have a mean value of one. 

The received signal yn{t) is demodulated by the methods introduced in Section [2.3.1l Both require approximately 
the same computational effort and achieve the same performance. The evaluated distances are fed to the Viterbi 
algorithm (VA), which we use for MLSE. In these demodulation methods, the trellis which is decoded by the Viterbi 
algorithm has pM^^^ states and M paths leaving each state. In our simulations, the Viterbi algorithm is truncated 
to a path memory of 10 code blocks, which means that we get a decoding delay of 3 ■ lOT. 

Figure [5] shows the simulations results for one, two and three transmitting antennas. It can be seen that full 
diversity is probably achieved and that linPC and offPC perform equal well if the optimal initial phase is chosen. 

A main reason for using CPM for STC is the spectral efficiency. Figured] shows the simulated power spectral density 
(psd) obtained by the analysis of Sm,r{t) with the Welch algorithm. The psd of the offPC has a negligible difference 
compared to the linPC. Consequently it is not plotted in Figure El The second Tx antenna uses a conventional CPM 
signal without correction factor and hence shows an equal psd. The spectra of the other antennas are shifted due to 
the additional offset Cmr{t) with a non zero mean. Minimizing the L^-norm of the difference between the unshifted and 
shifted spectra result in a phase difference of ±0.19 measured in normalized frequency / • T^, where Td = T / log2{M) 
is the bit symbol length. Compared to the frequency offset of 0.375 appearing for two L^-orthogonal antennas, the 
three antenna system requires approximately the same bandwidth. 

2.5 Conclusion to part two 

In this part, we introduce a new family of L^-orthogonal STC for three antennas. These systems are based on CPM 
supplemented by correction factors to ensure L^-orthogonality. Structurally the proposed code family has full rate and 
we expect full diversity. Furthermore, we detail two simple representatives of the code family (offPC, linPC), where 
the offPC offers better performance and a very intuitive representation. By analyzing the power spectral density, it 
is also shown, that the extension of the bandwidth, caused by the correction factor, is small. Therefore the power 
efficiency of CPM is maintained. 

General conclusion 

In this report, we detail the construction and analyze the properties of L2-orthogonal STC-CPM for two and three 
transmitting antennas. These codes are attractive due to low-effort-decoding and the few restrictions the code-family 
set upon parameters of CPM. Also, the simulation results show the importance of optimizing the initial phases for an 
efficient design and an optimal use of parallel codes. 
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